Relu merge optimizer pass #586

oliviaweng · 2022-06-29T00:26:23Z

Description

We introduce an hls4ml optimizer pass for merging the ReLU layer into the Dense/Conv2D layers when ReLU immediately follows them---a frequently encountered pattern in neural networks (NNs). NNs in hls4ml are spatially laid out using dataflow stages to implement each layer, which are linked together by FIFOs. These FIFOs can cost BRAMs, LUTs, and/or FFs. By default in hls4ml, each ReLU is implemented as its own dataflow stage. Because each additional dataflow stage costs extra logic and FIFOs, we reduce the resource utilization by merging the ReLU activation function into the layer preceding it. Although the layers with the newly merged ReLU functionality use more logic than before, there is still a net decrease in resources. This optimization was introduced in hls4ml's MLPerf TinyML Benchmark 2022 submission and written up in this paper. Resource reductions introduced by this optimization are reported in the paper.

Type of change

This optimization pass was first mentioned in the MLPerf TinyML PR #503.

New feature (non-breaking change which adds functionality)
A new research paper code implementation

Tests

This repo contains two test models (a fully-connected NN and a CNN) that can be trained on MNIST, converted into Vivado HLS, and synthesized using Vivado HLS 2020.1.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have added tests that prove my fix is effective or that my feature works.

… from their own Layer class

…e the relu layer class is not simply called Activation anymore. Need to fix

… layer index error

jmduarte · 2022-06-29T02:57:18Z

Hi @oliviaweng thanks a lot for the contribution, it looks great!

It seems some tests are failing: https://gitlab.cern.ch/fastmachinelearning/hls4ml/-/pipelines/4158655

I think most of the failures are relatively easy to fix, e.g. CONFIG_T::out_t is missing. Could you take a look and see if you can make them pass? We can also discuss if something is not clear with the error

oliviaweng · 2022-06-29T03:39:15Z

hls4ml/model/optimizer/passes/merge_relu.py

+    def match(self, node):
+        supported_layers = (Dense, Conv2D, Conv2DBatchnorm)
+
+        is_match = issubclass(node.get_input_node().__class__, supported_layers)
+        # ReLU layers are of class Activation
+        is_match = is_match and issubclass(node.__class__, Activation)
+        return is_match


I see. For the missing CONFIG_T::out_t error, it looks like this match() function is too generous. Since all activation layers are of subclass Activation, any Dense/Conv2D layer that is followed by any activation function returns True, which is wrong. I'll look into tightening this up.

…pass.

…erge

vloncar · 2022-07-05T12:32:51Z

A question on the approach itself: This will only work for ReLU and will require duplicate overrides for every possible combination of activation, layer and HLS implementation if we want to expand it. Instead of extending the kernel of the matrix-vector multiplication to tack on ReLU computation and then create duplicate function calls for that, why don't we introduce a new operation that is nop by default that sits at the end of the dense function? Basically the where the cast function sits now. Then the config can include a proper implementation of activation if we choose to merge it in. Like in config class we add template<class data_T, class CONFIG_T> using activation = nnet::some_activation_or_nop<data_T, CONFIG_T>;

This probably sounds unclear, so if you have 15 minutes we can chat over zoom.

jmitrevs · 2022-07-29T15:55:13Z

Should we discuss @vloncar's suggestion? It seems like there is quite a bit of interest in this PR so it will be good to get it in.

vloncar · 2022-07-29T15:57:40Z

It was discussed offline and we converged on the proper approach to this.

jmitrevs · 2023-01-18T22:45:15Z

Any more news on this?

oliviaweng · 2023-01-23T18:06:50Z

@abijithYayavaram is currently building a more generic version. We aim to push some updates within the next several weeks.

jmitrevs · 2023-10-20T14:43:06Z

What is the plan for this in general? Is this where we'll attack with the new code generation framework?

vloncar · 2023-10-20T14:49:24Z

I believe we should revisit this once we have a better way of generating functions in which activations would be merged. There's little gain in general if FIFO depth optimization is applied (and none for io_parallel).

oliviaweng and others added 5 commits June 22, 2022 16:46

WIP relu merge. Still need to make changes to

6160696

add merged_relu params to conv and dense templates by retrieving them…

6db0a46

… from their own Layer class

WIP merge_relu does not catch the right layer ordering pattern becaus…

5649532

…e the relu layer class is not simply called Activation anymore. Need to fix

Match supported merge relu layers by checking if it's a subclass. Fix…

6f59e18

… layer index error

Merge branch 'fastmachinelearning:master' into relu-merge

965737f

oliviaweng mentioned this pull request Jun 29, 2022

MLPerf Tiny developments #503

Open

7 tasks

jmduarte added the enhancement label Jun 29, 2022

jmduarte self-requested a review June 29, 2022 02:47

jmduarte assigned oliviaweng Jun 29, 2022

oliviaweng commented Jun 29, 2022

View reviewed changes

oliviaweng added 3 commits June 30, 2022 16:51

attempt to further restrict the matching function for the relu merge …

361553e

…pass.

Merge branch 'relu-merge' of github.com:oliviaweng/hls4ml into relu-m…

979aed3

…erge

WIP trying to resolve out_t issues with the mult configs

347a6bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relu merge optimizer pass #586

Relu merge optimizer pass #586

oliviaweng commented Jun 29, 2022 •

edited by anmeza

Loading

jmduarte commented Jun 29, 2022

oliviaweng Jun 29, 2022

vloncar commented Jul 5, 2022

jmitrevs commented Jul 29, 2022

vloncar commented Jul 29, 2022

jmitrevs commented Jan 18, 2023

oliviaweng commented Jan 23, 2023

jmitrevs commented Oct 20, 2023

vloncar commented Oct 20, 2023

Relu merge optimizer pass #586

Are you sure you want to change the base?

Relu merge optimizer pass #586

Conversation

oliviaweng commented Jun 29, 2022 • edited by anmeza Loading

Description

Type of change

Tests

Checklist

jmduarte commented Jun 29, 2022

oliviaweng Jun 29, 2022

Choose a reason for hiding this comment

vloncar commented Jul 5, 2022

jmitrevs commented Jul 29, 2022

vloncar commented Jul 29, 2022

jmitrevs commented Jan 18, 2023

oliviaweng commented Jan 23, 2023

jmitrevs commented Oct 20, 2023

vloncar commented Oct 20, 2023

oliviaweng commented Jun 29, 2022 •

edited by anmeza

Loading